Automatic Semantic Classification for Chinese Unknown Compound Nouns
نویسندگان
چکیده
The paper describes a similarity-based model to present the morphological rules for Chinese compound nouns. This representation model serves functions of 1) as the morphological rules of the compounds, 2) as a mean to evaluate the properness of a compound construction, and 3) as a mean to disambiguate the semantic ambiguity of the morphological head of a compound noun. An automatic semantic classification system for Chinese unknown compounds is thus implemented based on the model. Experiments and error analyses are also presented.
منابع مشابه
Semantic Classification of Chinese Unknown Words
This paper describes a classifier that assigns semantic thesaurus categories to unknown Chinese words (words not already in the CiLin thesaurus and the Chinese Electronic Dictionary, but in the Sinica Corpus). The focus of the paper differs in two ways from previous research in this particular area. Prior research in Chinese unknown words mostly focused on proper nouns (Lee 1993, Lee, Lee and C...
متن کاملSemantic Classification of Automatically Acquired Nouns using Lexico-Syntactic Clues
In this paper, we present a two-stage approach to acquire Japanese unknown morphemes from text with full POS tags assigned to them. We first acquire unknown morphemes only making a morphologylevel distinction, and then apply semantic classification to acquired nouns. One advantage of this approach is that, at the second stage, we can exploit syntactic clues in addition to morphological ones bec...
متن کاملAn Analysis of Persian Compound Nouns as Constructions
In Construction Morphology (CM), a compound is treated as a construction at the word level with a systematic correlation between its form and meaning, in the sense that any change in the form is accompanied by a change in the meaning. Compound words are coined by compounding templates which are called abstract schemas in CM. These abstract constructional schemas generalize over sets of existing...
متن کاملSemantic Labeling of Compound Nominalization in Chinese
This paper discusses the semantic interpretation of compound nominalizations in Chinese. We propose four coarse-grained semantic roles of the noun modifier and use a Maximum Entropy Model to label such relations in a compound nominalization. The feature functions used for the model are web-based statistics acquired via role related paraphrase patterns, which are formed by a set of word instance...
متن کاملA Study on Semantic Word-Formation Rules of Chinese Nouns from the Perspective of Generative Lexicon Theory - - - A Case Study of Undirected Disyllable Compounds
This paper mainly applies the qualia structure theory to the study of compound nouns whose lexical meanings cannot be inferred from their morpheme meanings by taking some undirected disyllabic compound nouns as example from the Chinese Semantic Word-formation Database. The paper concludes some specific ways by which morpheme meanings can be integrated with lexical meanings. It is hoped that the...
متن کامل